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Watching Every Step of the Way: Jumn Virus Attenuation Markers in the 
Vaccine Lineage 

Betina Ines Stephan*, Mario Enrique Lozano and Sandra Elizabeth Goni 

Area de Virosis Emergentes y Zoonoticas, Laboratorio de Biologi'a Celular y Molecular, Universidad Nacional de 
Quilmes, Argentina 

Abstract: The Arenaviridae family includes several hemorrhagic fever viruses which are important emerging pathogens. 
Jum'n virus, a member of this family, is the etiological agent of Argentine Hemorrhagic Fever (AHF). A collaboration be- 
tween the Governments of Argentina and the USA rendered the attenuated Jumn virus vaccine strain Candid#l. Arenavi- 
ruses are enveloped viruses with genomes consisting of two single- stranded RNA species (L and S), each carrying two 
coding regions separated by a stably structured, non-coding intergenic region. Molecular characterization of the vaccine 
strain and of its more virulent ancestors, XJ13 (prototype) and XJ#44, allows a systematic approach for the discovery of 
key elements in virulence attenuation. We show comparisons of sequence information for the S RNA of the strains XJ13, 
XJ#44 and Candid#l of Junta virus, along with other strains from the vaccine lineage and a set of Jumn virus field strains 
collected at the AHF endemic area. Comparisons of nucleotide and amino acid sequences revealed different point muta- 
tions which might be linked to the attenuated phenotype. The majority of changes are consistent with a progressive at- 
tenuation of virulence between XJ13, XJ#44 and Candid#l. We propose that changes found in genomic regions with low 
natural variation frequencies are more likely to be associated with the virulence attenuation process. We partially se- 
quenced field strains to analyze the genomic variability naturally occurring for Jumn virus. This information, together 
with the sequence analysis of strains with intermediate virulence, will serve as a starting point to study the molecular 
bases for viral attenuation. 
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INTRODUCTION 

The Arenaviridae family comprises only one genus 
formed of 25 species of arenaviruses, as categorized by the 
ICTV [1], and listed in (Table 1). A further classification is 
based on the geography of origin and distribution of the vi- 
ruses and their hosts: Old World Arenaviruses (OWA) and 
New World Arenaviruses (NWA). The NWA group harbors 
all arenaviruses from the American continent and the OWA 
group includes the only arenaviral species present in all con- 
tinents (Lymphocytic Choriomeningitis Virus - LCMV) as 
well as the African arenaviruses. To date, there are no re- 
ports of any native viral species from Europe or Australia. 
The precise geographic distribution pattern has its origin in a 
very narrow host range that is characteristic of all arenavi- 
ruses. The principal hosts for the NWA are several specific 
members of the rodent family Cricetidae, subfamilies Sig- 
modontinae and Neotominae [2], In the case of LCMV, the 
rodent reservoir is the worldwide distributed Mus musculus, 
which explains the unusual ubiquity of this virus. One NWA, 
the Tacaribe Virus (TACV), was isolated from bats [3], in 
what was thought to be an atypical occurrence for an 
arenavirus. However, a recent discovery of new arenaviral 
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species in boas [4] opens up new avenues of inquiry regard- 
ing the viral host range and biodiversity. 

This viral family is of medical interest because 6 of the 
25 members have been consistently associated with clinical 
symptoms and described as the etiological agents of human 
diseases. All pathogenic NWA produce hemorrhagic fevers, 
with very similar physiopathology (Table 2). The phases of 
the illness are well characterized and progress from prodro- 
mal to neurologic-hemorrhagic to convalescent [5]. There is 
a short incubation period before the onset of symptoms. In 
the case of Argentine Hemorrhagic Fever (AHF), the syn- 
drome is characterized by renal, neurological, vascular, im- 
munological and hematological alterations. If it remains un- 
treated, the mortality rate can be as high as 15-30%. 

Since it was described, AHF has been reported to cause 
annual outbreaks during the autumn- winter season [6]. How- 
ever, the number of actual cases in each outbreak has signifi- 
cantly diminished since the application of the vaccine to the 
risk population within the endemic area. For the 1992-2000 
period the efficacy of the vaccine was estimated at 98% [7]. 

The development of the vaccine against AHF was the 
product of a collaboration between the US and Argentine 
governments. A greatly attenuated strain of Junin virus 
(JUNV), Candid#l, was extensively tested in rhesus mon- 
keys and human volunteers with good results. Afterwards, a 
comprehensive clinical trial was conducted in the AHF 
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Table 1. Arenaviridae family. NR: not reported; OWA: Old World Arenavirus; NWA: New World Arenavirus; the lineages are 
listed as: A, B, C and Rec A/B. The distribution is based on the country of virus isolation. Table modified from [6, 7] 



Virus 


Acronym 


Evolutionary Lineage 


Distribution 


Reservoir 


Human 
Patogenicity 




FLEV 


NWA-A 


Brazil 


Oryzomys spp. 


Yes 


Pichinde 


PICV 


NWA-A 


Colombia 


0. albigularis 


NR 


Parana 


PARV 


NWA-A 


Paraguay 


0. buccinatus 


NR 


Allpahuayo 


ALLV 


NWA-A 


Peru 


Oecomys bicolor 


NR 




PIRV 




Venezuela 




NR 


Juin'n 


JUNV 


NWA-B 


Argentina 


C. musculinus 


Yes 


Machupo 


MACV 


NWA-B 


Bolivia 


C. callosus, C. laucha 


Yes 


Guanarito 


GTOV 


NWA-B 


Venezuela 


Z. brevicauda 


Yes 




SABV 




Brazil 




Yes 


Chapare 




NWA-B 


Bolivia 


Unknown 


Yes 




LATV 




Bolivia 




NR 


Tacaribe 


TCRV 


NWA-B 


Trinidad 


Artibeus spp. (bat) 


Yes 


Cupixi 


CPXV 


NWA-B 


Brazil 


0. capita 


NR 


Amapari 


AMAV 


NWA-B 


Brazil 


0. capita-Ncacomys guianae 


NR 


Oliveros 


OLVV 


NWA-C 


Argentina 


Bolomys obscurus 


NR 


Whitewater Arroyo 


WAV 


NWA-Rec A/B 


USA Southwest 
(NM,TX,UT,OK) 


Neotoma albigula, N. mexicana, 
N. micropus, N. sinerea 


Yes 


Lassa 


T A Q\T 
LA^ V 


OWA 


Nigeria, Ivory Coast, 
Guinea, Sierra Leone 


Mastomys sp. 


Vac 

i es 


Tamiami 


TAMV 


NWA-Rec A/B 


Florida, USA 


Sigmodon hispidus 


NR 


Bear Canyon 


BCNV 


NWA-Rec A/B 


California, USA 


Peromyscus sp. 


NR 


Lymphocytic Choriomeningitis 


LCMV 


OWA 


Worldwide 


Mus musculus 


Yes 


Mobala 


MOBV 


OWA 


Central African Republic 


Praomys sp. 


NR 


Mopeia 


MOPV 


OWA 


Mozambique 


Mastomys natalensis 


NR 


ippy 


IPPYV 


OWA 


Central African Republic 


Arvicanthis sp. 


NR 


Lujo 


LUJV 


OWA 


South Africa 


Unknown 


Yes 





Table 2. Arenaviral hemorrhagic fevers pathological characteristics. The name of the disease (when existing), along with the prin- 
cipal clinical symptoms caused by Old World Arenaviruses and New World Arenaviruses are briefly explained. Modified 
from [13] 



Agent (Disease) 




Old World 

Lassa (Lassa Fever) 


Incubation: 3 to 21 days. Fever, headache, myalgia, backache, trembling and sickness. 

Generalized infection: hemorrhagic dissemination of virus to several organs and systems via bloodstream, lymphatic 
system, respiratory and digestive tract. 

Black vomit, aqueous diarrhea (dehydration), quantity decrease of lymphocytes and platelets, mild thrombocytopenia, 
abdominal, pleuritic and hepatic pain. 

Extensive reticulo-endothelial compromise: capillary injuries causing stomach, small intestine, kidney, lung and brain 
bleeding. 

Multifocal hepatocellular necrosis with Councilman-like bodies, hepatocytes cytoplasmic degeneration and minimal 
inflammatory response. 

Adrenal focal necrosis and cytoplasmic inclusions. 

Respiratory system: interstitial pneumonia, cough, dyspnea, bronchitis, pneumonia and pleurisy. 

Cardiovascular system: pericarditis, tachycardia, bradycardia, hypertension, hypotension, thrombocytopenia, leukopenia 
and hiperuricemia, lymphadenopathy, elevated aminotransferases, decreased prothrombin level, disorder of blood circu- 
lation and bleeding through the skin, lungs, gastrointestinal tract and other mucosa membranes. 
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Agent (Disease) 



New World 

Junin (Argentine Hemor- 
rhagic Fever) 

Machupo and Chapare 
(Bolivian Hemorrhagic Fever) 

Guanarito (Venezuelan Hem- 
orrhagic Fever) 

Sabia (Brazilian Hemorrhagic 
Fever) 

Whitewater Arroyo 
Flexal 



Pathological Characteristics 




Incubation: 6 to 14 days. First 4 days: decaying, fever, anorexia, nausea and vomiting, headache i 
stage: acute hemorrhagic syndrome (epistaxis and hematemesis), or acute neurologic syndrome. ( 
fever, severe myalgia, anorexia, back pain, abdominal tenderness, conjunctivitis, retro-orbital pain, ph 
usually constipation. 

Acute phase of infection: peripheral blood viral active replication. Oropharyngeal enanthem. 
Gums swollen, congested and bleeding (gingival border). Proteinuria high dehydration and hemoconcentration 
women, early menorrhagia. 

Multifocal hepatocellular necrosis with formation of Councilman-like bodies, nuclear pyknosis, cytoplasmic eosino- 
philia, cytolysis, inflammation and a mild cellular infiltration composed of mononuclear cells and neutrophils. 
Kidney damage: distal tubular cells and collecting ducts. Glomeruli or proximal tubules. In a few cases, renal failure 
was reported. 

Cardiovascular system: postural hypotension and relative bradycardia, arrhythmias are transient and benign. Different 
degrees of dehydration, uremia, proteinuria, hematuria and oliguria. 

Respiratory: dry cough, without sore throat. Pharyngeal enanthem. Interstitial or bronchial pneumonia, pulmonary 
edema and hemorrhage. 



endemic area [8, 9]. A diagram with the passage history that 
led to the attenuated JUNV Candid#l strain is shown in (Fig. 
1). The intermediate strains were named XJ#, in reference to 
the original strain and the number of passages in mouse 
brain. The vaccine was originally produced in the USAM- 
RIID (United States Army Medical Research Institute for 
Infectious Diseases) facilities. In 2005, the National Institute 
of Human Viral Disease (INEVH) located in Pergamino, 
Argentina, finished the certification process and took over 
the vaccine production for local use. 

Other vaccines against Arenaviruses have not been re- 
leased. Significant advances were made to develop an effec- 
tive protection against Lassa fever, which is a disease en- 
demic to the West-African region with 3-5xl0 5 cases re- 
ported annually [10]. Its apparently expanding territory, to- 
gether with the imported cases reported yearly all over the 
world, make this virus the biggest health threat of the 
Arenavirus family [11]. Several different approaches were 
used to try to achieve protective coverage with a Lassa virus 
vaccine. However, the necessity of a single-dose treatment 
and the great diversity among the many viral strains 
converge to increase the difficulty of this task [12]. 

Arenaviruses are enveloped viruses with a bipartite, sin- 
gle stranded RNA genome [14, 15]. The virion contains 
three structural proteins (L, N and Z) while the lipid enve- 
lope carries the three-piece glycoprotein (SSP, Gl and G2). 
The larger genomic segment (L RNA, ~7kb) encodes the 
RNA dependent RNA polymerase (L protein), which is the 
minor component of the virion, as well as Z, a Zinc finger- 
protein of 1 1 kDa that could be the arenaviral counterpart of 
a matrix protein [16]. The other genomic segment (S RNA, 
-3,5 kb) codes for the nucleocapsid protein N and the glyco- 
protein precursor (GPC), which is processed by cellular pro- 
teases into three parts. The arrangement of the two open 
reading frames in opposite orientations, separated by the 
stably structured, non-coding intergenic region, gave rise to 
the term "ambisense coding" [17]. The first 19 nucleotides at 
each end of one segment are complementary, which allows a 
base-pairing that forms a panhandle structure highly con- 
served among arenaviruses [18]. 



SEQUENCE ANALYSIS OF THE VACCINE- 
RELATED JUNV STRAINS 

In order to identify and characterize changes directly 
related with the attenuated phenotype of the JUNV vaccine 
strain, the complete nucleotide sequence of all available in- 
termediate strains (XJ13, XJ17, XJ34, XJ39, XJ#44, XJ48 
and Candid#l) from the vaccine genealogy were analyzed, 
along with a small number of field strains [19, 20]. 

A comparison of the sequence of the vaccine genealogy 
strains revealed a set of differences that might be associated 
with the decrease in virulence (Fig. 2). An alignment of the S 
RNA-derived coding sequences disclosed eleven individual 
changes which translated into amino acid substitutions. The 
first of these changes was located in the stable signal peptide 
(XJ13 I 35 ^ XJ17 I->XJ34 I->XJ39 I->XJ#44 V 35 ^ XJ48 
I->Candid#l V), the following three were in the mid-part of 
Gl (T 168 ^A->A->A->A->A->A; E 186 ^E^E^E^E^E->G 
and S 206 ->S->S->S->S->S->P), two more were found at 
the carboxyl terminus of G2 (F 427 ->F->F->F->F->F->I and 
T 446 ->T->S->S->T->S->S), and the last five were at the 
amino half of N (V 47 ->V->V->V->V->V->E; R 59 ->K->K-> 
K->R->K->R; I 158 -H-H-H->V-M->V; E 268 ^E->E->E->E 
->E->D and T 322 ->T->T->T->I->T->I). The same analysis, 
performed on the coding sequences derived from the L gene, 
revealed only nine nucleotide changes implied in amino acid 
substitutions. Two of these are reversions (XJ13 R 881 ->XJ#44 
G->Candid#l R; S 921 ->G->S), while the other seven might 
be associated with the attenuated phenotype (H 76 ->Y->Y; 
V 415 ^V^A; D 462 ^N^N; L 936 ^L^P; R 1156 ^K^K; 
S 1698 ->S->F and I 1883 ->I->V). The Z gene showed no nucleo- 
tide changes among these three JUNV strains. 

This analysis was extended to include two other com- 
pletely sequenced JUNV strains, Romero and MC2, as well 
as a partial sequence from the Cba-IV4454 strain (S RNA 
and Z gene). The first strain is classified as of high virulence 
and was originally isolated from an AHF patient [21, 22], In 
the Genebank its name is mistakenly spelled "Rumero". The 
second strain (MC2) is of intermediate virulence and was 
isolated from a rodent captured in the AHF endemic 
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Strain 



Passage History 



Characteristics 



C 

C 

c 



Isolated by Parodi in 1958 
from an AHF patient in Junin 
city 

XJ ± 2 passages in guinea pig 
XJ#44 ^ passages in mouse brain 
amplified by one round of 
mouse brain injection 

Adaptation 
12 passages without viral 
Candid#l dilution in FRhl-2 cells. 
(Master Seed) Attenuation 

4 passages in FRhl-2 cells: 
(Pseudo single burst, dilution, 
amplification) 



#1 Amplification: 1 passage 

(Secondary Seed) FRhl-2 Cells. 

Candid#l Amplification: 1 passage 

il FRhl-2 cells. 



Virulent 



Attenuated for guinea pig 
and virulent for mouse 
Lethality index: 1,37 ±0,45 



Attenuated for guinea pig, 
mouse and human. 
Lethality index: 3,87 ±0,42 



Fig. (1). History of passages for the Junin virus vaccine strain Candid#. Adapted from [11]. 
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Fig. (2). Summary of the changes found in Junin virus proteins. A. S RNA; the N and GPC ORFs are shown as arrows, indicating the trans- 
lation sense. The strains used for the comparison are shown on the left side. The parts of GPC are marked below the diagram. Amino acid 
changes among the strains (XJ#13, XJ#17, XJ#34, XJ#39, XJ#44, XJ#48 and Candid#l) are drawn as lines above the gene, followed by the 
identity of the change and its position. Variations among the field strains (XJ#13, Romero, IV 4454 and MC2) are drawn as lines below the 
gene. B. Relative frequency of mutation for the different types: black is for type- land grey for type-2. 



area [23-25]. The Cba-IV4454 strain was also obtained from 
a patient and is considered of intermediate virulence. The 
analysis was performed on deduced amino acid sequences 
for the six aforementioned JUNV strains. Two distinct types 
of mutations were identified, according to a specific defini- 
tion. Type 1 mutations are those where only one of the 
JUNV field strains (XJ13, Romero, IV4454, and MC2) dif- 
fers from all others. Type 2 mutations refer to the positions 



with changes among the vaccine-related strains (XJ13, 
XJ#44 and Candid#l). This last type allows identifying key 
positions that might be involved in the attenuation process, 
while the Type 1 mutations take into account the naturally 
occurring variations. An overlap of these two types of 
changes uncovered several mutations with probably little 
significance for the viral virulence. Positions GPC 35 , N 15g , 
N 2 68 an d N 322 all had changes among the six sequences that 
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varied between two particular amino acids, regardless of the 
strain's origin. Something similar happened with positions 
L 881 and L 921 , where the only difference was found in XJ#44. 

A mutation frequency index was calculated, defined as 
the number of mutations per amino acid in an overlapping- 
window strategy, with the resulting value assigned to the 
central residue. The analysis was made for both types of mu- 
tations for the complete JUNV genome sequence, using in- 
house software. This global plotting approach gives an over- 
view of the distribution of the mutations along the genome. 
Several possible hot-spots were identified, that is, regions of 
the sequence with a marked tendency towards natural varia- 
tion. Also, a small number of type 2 mutations were found 
outside the hot-spots, namely GPC 168 , GPC 427 , GPC 446 and 
N 47 for the S RNA and positions L 76 , L 936 , L 1156 for the L 
RNA. These changes could more confidently be assigned to 
the virulence attenuation process. 

CODING REGIONS 

A further search for Type 2 mutations was performed on 
a set of field samples, isolated from human cases or rodents, 
captured in the endemic area during the period of 1963-91. 
Out of the genome, four regions were selected for this analy- 
sis: i) G fragment, comprising positions 303 to 941 within 
the GPC coding sequence; ii) N fragment, spanning nucleo- 
tides 1632 to 2095 of the nucleoprotein coding sequence and 
two regions within the open reading frame of the RNA de- 
pendent RNA polymerase [13]. The fragments were ampli- 
fied by PCR and sequenced for 13 different JUNV field 
strains. Afterwards, the sequences were translated in silico, 
aligned and scanned for positions with differences between 
strains. A part of the Type 2 mutations obtained for the G 
fragment are shown in (Table 3). The selected region com- 
prises most of Gl, the consensus site for the arenavirus gly- 
coprotein processing [26] and the beginning of G2. Some of 
the variants present among the field samples (from both hu- 
mans and rodents) were also found among the attenuated 
JUNV strains. Consequently, the importance of these posi- 
tions in the attenuation process should be re-evaluated. 

The development of a reverse genetics system made it 
possible to directly test the role of the different variable posi- 
tions in viral fitness and virulence. Albarino et al. used chi- 
meric genomes, baring a combination of XJ13 and Can- 
dida-derived sequences, to assess the importance of each 
portion of GPC. They found that only the Candid#l -derived 
G2 part was critical for the attenuated phenotype. A further 
characterization mapped the significant points to the trans- 
membrane domain within G2 [27]. However, a number of 
variable positions remain to be evaluated by this experimen- 
tal method. 

Previous studies described four conserved regions in the 
L protein of all arenaviruses [28]. The polymerase domain 
was mapped to the region III, with the presence of four con- 
served motifs: A, B, C and D [29]. The variation analyses 
showed only one change in this region, at position 1156 
(R— >K— >K). Albeit the mutation has a conservative charac- 
ter, it is probably related to the attenuation process because 
there are no natural variants in this region. The other changes 
in the L ORF are located between regions II and III (L 93 6, 
L— >L— >P) and in region I (L 76 , H— >Y— >Y). The modification 
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of a Leucine for a Proline brings about a mayor structural 
change and is present only in the Candid#l strain, in spite of 
being immersed in a region of high type 2 mutations index. 
The change at position L 76 is near a predicted ATP/GTP 
binding site (www.expasy.org). Even though these results 
are preliminary, they suggest an important role for the po- 
lymerase in the virulence attenuation, in accord with reports 
for other viruses [29-31]. Also, some of the mutations in the 
structural proteins (nucleoprotein and the two glycoproteins) 
could be associated with the attenuation of virulence. The N 
protein has a highly conserved carboxyl-terminus, containing 
a zinc binding domain [32]. The mutation at N 47 (V— >V— >E) 
lies outside that region, but could be related to other charac- 
teristics of the protein, such as the oligomerization capability 
[33], possibly interfering with one or both of N's functions 
in the viral life cycle. In the genome replication, N promotes 
the synthesis of antigenomic, full-length copies of the S 
RNA. In the assembly of the virion, the protein binds the 
genomic RNA, forming the nucleocapsid. The other struc- 
tural protein, GPC, sported a mutation at the carboxyl end of 
Gl (GPCiss, T— >A— >A), directly affecting the conserved 
sequence (N 166 R 167 Ti 6 8Ki 6 9) for a predicted N-glycosylation 
site (NETGLYC 1.0). For the G2 portion, three domains 
have been described. First, the outer domain, located at the 
carboxyl end, which is outside the virion and interacts with 
Gl; second, the transmembrane domain, which interacts with 
the stable signal peptide, and last, the inner domain, situated 
at the amino end, which could interact with the nucleocapsid 
protein or the Z protein, on the inside of the virion [26, 34]. 
The changes within G2 are in the transmembrane domain 
(GPC 42 7, F— >F— >I) or in the cytoplasmic tail (GPC 446 , 
T— >T— >S) and, in both cases, could affect these important 
interactions. 

The potential attenuation markers were found to be un- 
evenly distributed in the genomic segments, with a higher 
degree of conservation in the L RNA, which indicated a 
faster evolution rate for the polypeptides encoded in the S 
RNA. In a comparison of genomes of virulent and avirulent 
strains of Pichinde arenavirus, Lan and collaborators [29] 
found that attenuation related mutations mapped to compara- 
ble genomic regions, but were fewer in number. If the com- 
parison is limited to only the field strains of Jumn virus 
(XJ13, Romero, MC2 and IV4454) the mutation distribution 
pattern became markedly different for the large and small 
genomic segments. In a protein sequence analysis of the field 
strains, 45 divergent sites were found for the S RNA derived 
ORFs and 48 sites for the L RNA derived ORFs. Interest- 
ingly, for the L RNA, 46 out of the 48 sites were XJ13- 
specific changes, while for the S RNA, this was only the 
case in 4 of the 45 sites. 

As stated before, the G fragment included the sequence 
for the cleavage site between Gl and G2. The recognition 
site sequence has been published as QLPRRSLK251AFF [35] 
with the cleavage occurring between K251IA or LJ,K 2 5i [36]. 
Most of the analyzed samples didn't have mutations in that 
region. Only isolate H_FHA5054 presented changes at posi- 
tions 244 and 245 (Q— >H and L— >F respectively), but it is 
not certain if these modifications would have any detectable 
outcome. A portion of the variation observed for the G frag- 
ment, namely the part corresponding to Gl, could be associ- 
ated with the host jump of a rodent strain to humans (recep- 
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Table 3. G fragment positions with variations in amino acid sequence. The changes with regard to the XJ13 strain are annotated 
in bold letters. Type 2 mutations among vaccine related strains are shadowed in gray. Adapted from [13] 



Residue Position 


X.I 1 3 


X.I17 


XJ34 


XJ39 


XJ#44 


XJ48 


Candid#l 


MC2 


IV4454 


Romero 


AN_8640 


in 

90 
1/1 

1 

Z 
< 


AN_13365 


AN_16501 


AN_17058 


AN_17246 


AN_17116 


AN_17064 


- 

1 

B 


H_FHA5069 


H_pl879 


H_FHA5054 


o 

I 

a 


107 


















T 






s 














S 


T 


T 


T 


s 


109 


n 

V 
















M 


M 




Q 




M 


n 

V 




M 


M 


Q 


M 


M 


M 


Q 


111 


















T 






s 






T 








s 


T 


T 


T 


s 


116 


A 


A 


A 


A 


A 


A 


A 


A 


A 




A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


121 


n 

V 


n 


n 


n 


n 


n 


n 




o 

V 


n 


n 


Q 




o 

V 


n 


n 


n 


n 


Q 


n 


n 


n 


Q 


125 


j 


j 


j 


j 


j 


j 


j 


j 


y 


j 


j 


I 


j 


J 


y 


j 


j 


j 


I 


Y 


Y 


Y 


I 


133 
























s 




N 






N 


N 


s 








s 


1 


w 
w 


w 
w 


w 
w 


w 
w 


w 
w 


w 
w 


w 
w 


w 
w 


w 
w 


w 
w 


w 
w 


w 


w 
w 


w 
w 


w 
w 


w 
w 


w 
w 


w 
w 


w 


D 
K 


K 


D 
K 


w 


151 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


A 


Y 


Y 


Y 


A 


157 


H 


H 


H 


H 


H 


H 


H 


Y 


Y 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


168 


T 


A 


A 


A 


A 


A 


A 


T 


T 


T 
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tor affinity differences). They could therefore be directly 
linked with changes in the strains virulence (increase or de- 
crease) through a cellular tropism change. Another key fea- 
ture of this region was the N-glycosylation target sequence 
Ni66Ri67Ti 68 Ki69, as determined by NETGLYC 1.0. This 
precise sequence is only present in the 65% of the field 
strains, while in the remaining 35% there is an alanine resi- 
due (Ai6s)- Complementary bioinformatics studies performed 
on this region predicted only slight changes to the protein 
properties. 

On the other hand, the importance of two positions, 
strongly related to the N-glycosylation of JUNV and MACV 
was established. Also a high conservation degree of the resi- 
dues involved, along with their environ was found to be nec- 
essary [37]. In recent years, an analysis of LCMV proved the 
implication of each N-linked glycan of the glycoprotein with 
GPC expression, fusion activity and infectivity [38]. Interest- 
ingly, all the attenuated strains derived from XJ13 have an 
A i68 mutation. The hypothesis was that mutations present in 
field strains (natural variants) would be of minor importance 
in virulence attenuation. In fact, none of the type two muta- 



tion analyzed here were absent in field samples. Therefore, 
the data presented in (Table 3) could signify that the attenu- 
ated phenotype is not influenced by mutations in the Gl re- 
gion. That result concurs with Albarino and collaborators 
[39], who mapped the essential attenuation-related mutations 
toG2. 

NON CODING REGIONS 

Previous studies suggested that sequence changes within 
the intergenic region could play a role in the attenuation 
processes of arenaviruses [40]. Conversely, our analysis of 
the intergenic regions from Candid#l, XJ48, XJ#44, XJ39, 
XJ34, XJ17 and XJ13 revealed a total conservation among 
the strains for both genomic RNAs. If there is a low toler- 
ance for nucleotide changes in the region, it would suggest 
an evolutionary constraint, probably related to the calculated 
secondary structure and the associated transcription regula- 
tion function [32, 41]. Also, there was a high degree of con- 
servation of the non-coding regions at the genomic ends of 
the analyzed strains, with nucleotide homology values rang- 
ing from 93% to 97%. 
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However, when viral RNAs obtained from infected cells 
were analyzed, a great variability could be observed at the 3' 
and 5' non-coding regions of RNAs derived from the same 
parental strain. The analysis was designed to evaluate the 
genomic and antigenomic forms separately. A number of 5' 
non-templated bases were found for both Candid#l genomic 
segments. When comparing the 5' end of the genomic RNAs 
with the 3' end of the antigenomic RNAs (which are the 
template for the former), at least one additional guanine was 
present at all 5' ends of genomic clones with additional 
bases, as has been described for other arenaviruses [42]. On 
the contrary, the 3' RACE of genomic forms yielded several 
clones with short deletions. A calculation of RNA secondary 
structure from Candid#l S and L RNAs predicted the forma- 
tion of a panhandle between the 5' and 3' ends of each ge- 
nomic segment. The observed 3' deletions all mapped to 
regions within the panhandle (Fig. 3). 

Taken together, these results support the employ of cell- 
derived RNAs as primers for the synthesis of viral RNA, as 
well as 5' terminal sequences from virus-derived RNAs into 
incomplete panhandle structures as templates for the comple- 
tion of the 3' terminal sequences. The resulting heterogene- 
ity may well be a consequence of the action of different 
RNA editing mechanisms, as reported for other arenaviruses 
[43]. Still, the role of these regions in the process of attenua- 
tion remains to be established. 

Interestingly, a comparison of the sequence of the ends 
(3' and 5') of both genomic segments revealed several posi- 
tions that were highly conserved and could be part of the 
minimal promoter. The untranslated sequences on the ex- 
tremes of the S RNA, as well as the 5' end of the L RNA, 
have an approximate length of 80 nucleotides. This is not the 
case for the 3' end of L RNA, which has only 30 nucleotides. 
Considering that the genomic forms serve as templates for 
the antigenomic forms and vice versa, all non-coding termi- 
nal regions have to include the promoter sequence that is the 

L 
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recognition site of the viral polymerase, in order to achieve 
the completion of the replicative cycle. Since one of these 
regions only has a size of 30 nucleotides, it is reasonable to 
conclude that the minimal promoter sequence is contained in 
that range. In support of this, if a sequence comparison is 
made of the 80 nucleotides of the ends of S and L RNAs, the 
homology value is 60% for the 5' ends and only 50% for the 
3' ends. But, if the analysis is reduced to only 30 nucleo- 
tides, the value of the 3' region ascends to 71%. In fact, there 
is a 19 nucleotide sequence motif highly conserved among 
arenaviruses (arena region) that can be found in all JUNV 
genomic ends. The remaining 1 1 bases of the 3' end of the L 
RNA (GCTCAAGTGCC) have an elevated homology de- 
gree with two regions within the S RNA 3' end. These re- 
peated elements could be part of the translation or transcrip- 
tion processes. They have been found in other arenavirus 
species, in similar distributions. A comprehensive analysis of 
the genomic sequence of sublcade Bl New World arenavi- 
ruses [44] showed that the element that mapped to positions 
38-46 (GCUCAAGUG for JUNV L RNA and GCUCAGUG 
for S RNA) was conserved within the group [20] . Therefore, 
it could be possible to identify a sequence motif at the 3' end 
of both genomic RNAs of most arenaviruses. Also, in the 
particular cases of JUNV, Machupo and Tacaribe RNA S, 
the motif described as GSYC(A)i_ 2 GUR, has a relatively 
high positional conservation within the RNA secondary 
structure when calculated by bioinformatic tools. 

CONCLUSIONS AND SUMMARY 

This works aim was to present a condensed report of the 
studies about the molecular determinants of viral virulence in 
JUNV. A set of putative attenuation markers have been iden- 
tified within the GPC, Z and L ORFs of JUNV vaccine 
strain, Candid#l. Some of these positions were more confi- 
dently linked to the attenuated phenotype. The main reason 
for this is that these changes were found inside regions with 
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Fig. (3). Junfn virus Candid#l genomic end sequences. Predicted panhandle structures for the L and S RNAs. The region found in all 
3 'end sequences is shadowed. The sequence-logos show the nucleotide frequency determined by a RACE assey. A non-templated G is high- 
lighted at the end of the L segment. Taken from [13] 



422 Current Genomics, 2013, Vol. 14, No. 7 



Stephan et al. 



a low natural mutation frequency (GPC168, GPC427, GPC446, 
N 47 , L 76 , L 936 and Ln 56 ). They were analyzed with phyloge- 
netic and bioinformatic tools. 

Increasing complete genome sequence information avail- 
ability, particularly from JUNV strains, has been a signifi- 
cant contribution for the identification of further putative 
virulence attenuation markers. In recent years several se- 
quences were published for members of the vaccine strain 
lineage [20, 27]. They were included in previously done in- 
formatic analyses, confirming the results observed before, 
that is, only a few changes are necessary to obtain an attenu- 
ated phenotype. To further study the role in the virulence 
attenuation process of different point mutations, Albarino 
and collaborators developed a JUNV reverse genetics system 
[39]. Their results showed a direct involvement in the at- 
tenuated phenotype of certain point mutations within the 
GPC ORF of Candid* 1. 

Moreover, in order to analyze the natural mutation distri- 
bution, Goni and collaborators designed a RT-Nested-PCR 
based technique and applied it to human and rodent derived 
samples from the Argentine Hemorrhagic Fever endemic 
area [45]. The sequences thus obtained were translated in 
silico and analyzed with different tools. A comparison was 
instituted between the GPC derived sequence data (G frag- 
ment) and the results published by Albarino and collabora- 
tors [25]. (Table 3) summarized the findings of that analysis. 
As is expected, the greater variability appears mainly in the 
portion of the glycoprotein complex that is exposed to the 
exterior and therefore comes in contact with the immune 
system. Changes to the sequence and conformation of epi- 
topes could explain an increase in virulence due to escape 
from the natural host defenses. Also, slight variations within 
the receptor-binding region have been shown to confer the 
capability of infecting human cells as well as the animal host 
[46]. However, there are certain constraints acting on the 
outer part of the glycoprotein, related to its capacity of effi- 
ciently performing the original function within the viral cy- 
cle. 

Studies addressing similar questions for other arenavi- 
ruses produced interesting results. Zhang and collaborators 
[47] created reassortants using an attenuated and a virulent 
PICV parental strain. They found that the virulence in guinea 
pigs could mainly be attributed to the S segment. Neverthe- 
less, the virulence level of the parental strain was not at- 
tained, pointing to determinants being present in the L seg- 
ment. Similar studies made with LCMV strains mapped the 
attenuation markers of this virus principally to its L segment 
[31]. Although both studies determine the virulence in an 
animal model of guinea pigs, the results obtained for each 
virus are different. This suggests that each attenuation proc- 
ess produced a distinct set of mutations in different regions. 
It would be interesting to assay what these mutations are for 
JUNV, principally taking into account the available interme- 
diate strains of the vaccine genealogy, and the immense 
combination probabilities. 

The sequence information accumulated through the 
analysis of a number of strains with varying virulence de- 
grees will be the springboard for natural biodiversity studies. 
The mutations identified as potentially central to the attenua- 
tion process are ideal candidates for mutagenesis and reverse 



genetics assays. However, more complex studies will have to 
be made to determine the minimal changes that render an 
attenuated phenotype, since it is likely that many variations 
act synergically to decrease the observed virulence. Also, 
compensatory changes might arise to make up for structure 
or activity losses that hinder viral fitness [48, 49]. Therefore, 
even if a particular change is found in both field and vaccine 
strains, it does not necessarily negate its importance in viru- 
lence attenuation. Conversely, some variants present in natu- 
ral and vaccine strains might have a cumulative effect upon 
the activity of a single viral protein or on the capacity of in- 
teraction with other viral or cellular proteins. The Z protein 
shows a low number of variations. This protein is responsi- 
ble for the budding of the virus and the correct virion assem- 
bly. It also has several functions related to the modulation of 
the host immune response [50]. All of the roles of Z within 
the viral cycle are performed through interaction with pro- 
teins. The apparently low tolerance of changes within the Z 
ORF suggests that its functions greatly depend on the correct 
structure/sequence. Possibly, the regions of interaction of its 
partners are also submitted to a similar constraint. It will be 
interesting to analyze the precise regions once they have 
been positively identified. A great step towards identifying 
possible co-variation sites will come with the elucidation of 
the crystallographic structure of the viral polypeptides and 
their interaction partners. The beginning has been made with 
the first glycoprotein and partial nucleoprotein and L [51-53] 
structures and we are hoping for a steady advance in this 
area. 

Summing up, this work presented a set of mutations that 
are probably related with an attenuated virulence phenotype. 
Also, most of the mutations analyzed here (Type 1 and 2) 
were mapped to a few discrete genomic regions. The devel- 
opment of a RT-PCR based method followed by sequencing, 
was the basis for a broad analysis of JUNV strains in search 
of natural genomic variability. This method could further- 
more be useful if a program for the monitoring of vaccinated 
individuals is implemented. 
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